Thread Batching for High-performance Energy-efficient GPU Memory Design
نویسندگان
چکیده
منابع مشابه
A High-Performance Hardware-Efficient Memory Allocation Technique and Design
This paper presents a hardware-efficient memory allocation (EMA) technique designed to eliminate both internal and external fragmentation that appear in the buddy system. EMA can allocate a free memory block of any size in any part of memory. Hardware implementation of EMA is introduced, but only part of its circuits is shown in the paper due to the space limitation. Simulation results show tha...
متن کاملDesign of Energy-Efficient High-Performance ASIP-DSP Platforms
In the last ten years, limited clock frequency scaling and increasing power density has shifted IC design focus towards parallelism, heterogeneity and energy efficiency. Improving energy efficiency is by no means simple and it calls for a reevaluation of old design choices in processor architecture, and perhaps more importantly, development of new programming methodologies that exploit the feat...
متن کاملAn Efficient LUT Design on FPGA for Memory-Based Multiplication
An efficient Lookup Table (LUT) design for memory-based multiplier is proposed. This multiplier can be preferred in DSP computation where one of the inputs, which is filter coefficient to the multiplier, is fixed. In this design, all possible product terms of input multiplicand with the fixed coefficient are stored directly in memory. In contrast to an earlier proposition Odd Multiple Storage ...
متن کاملCan PCM Benefit GPU? Reconciling Hybrid Memory Design with GPU Massive Parallelism for Energy Efficiency
In recent studies, phase changing memory (PCM) has shown promising energy efficiency for systems with a modest level of parallelism. But it remains an open question whether it can benefit GPU-like massively parallel systems. This work conducts the first systematic investigation into this question. It empirically shows that contrary to the promising results shown before on CPU, the previous desi...
متن کاملEncodings for High-Performance Energy-Efficient Signaling
Energy eÆciency, performance and signal integrity are conicting critical requirements for on-chip signaling. We propose a code-based solution that improves bit rate while reducing communication energy and preserving noise margins. Our technique is based on the observation that RC lines can be used at twice their limiting bit rate to transmit bit streams with no isolated bits. We propose new enc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Journal on Emerging Technologies in Computing Systems
سال: 2019
ISSN: 1550-4832,1550-4840
DOI: 10.1145/3330152